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ABSTRACT ' . 

Two issues in the controversy over the use, 
interpretation, validity, and relevance of student ratings are 
explored here: the definition of preferred teacher behavior in terms 
of the rating instrument used, and the relationship between the 
student^s expected grade, grade point average, and the ratings 
recorded. The objectives of the present research were threefold: (1) 
to develop a short rating scale and associated diagnostic checklist; 
(2) to examine the construct validity and reliability of the scale; 
and (3) to assess the relationships between selected student 
demographic data and ratings on the scale. After a pilot test, there 
appeared to be little advantage to using an observed/preferred 
behavior format, so that format was discontinued in further revision 
of the scale, A repeated measures analysis of variance indicated that 
discrepant groups, i.e., those whose actuTal grade was two or more 
letter grades away from his/her expected grade, should be omitted 
from computations to increase the validity of student ratings. 
Further research was suggested to determine whether demographic 
variables interact with group discrepancy to influence evaluations of 
teacher behaviors. (Author/3W) 
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student Ratings: 
What is the Frame of Reference 



Anthony Lolli, Jr. and Steven V. Owen 
University of Connecticut 

Introduction , 

Controversy over the use, intrepretation, validity, and relevance 
of student ratings continues. One particular issue that has received 
relatively little attention is the student's evaluative standard or 
frame of reference. Users. of student ratings typically assume that 
students prefer the positive extreme in nearly all rating items. Yet 
students may not always prefer the extreme values in rating items. Not 
o; is there variability of preference among students within a single 
clas:;; there are also systematic shifts across classes in perceptions 
of the "ideal" teacher. These shifts appear to be strongly related to 
the student's major area of study (Slater and Owen, 1974). 

Levinthal et al . (1971) asserted that student ratings of teacher 
behaviors imply comparative judgments between the values of the observer 
and his observations. Levinthal and his colleagues found that judgments 
of ideal or "preferred" behaviors varied across students and items, as 
did interactions between observed and preferred responses. These re- 
searchers called for additional research and use of a two scale (observed/ 
preferred) rating format. 

It seems possible that the meaning and validity of rating instruments 
would improve if students were asked both what they observed and what 
they would prefer to observe. For example, if the mean observed rating 
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on a five-point item were, say, 4-0, the instructor could apparently 
improve a full point on that item. However, if he discovered that the 
mean preferred rating on the same item were 4.0, there is not much room 
for improvement. Indeed, if a rating of 4.0 is the preferred locus, 
then an observed rating higher than 4.0 could not accurately be regarded 
as an improvement. 

Another area concerned with the student's frame of reference was re- 
searched by Bausell and Magoon (1972). The question of whether a student's 
expected grade and GPA systematically affect subsequent teacher evaluations 
has not been answered clearly. The reason, in part, is that some studies 
have examined cases in which grades were awarded prior to the students • 
evaluation of their instructors, while other studies required students to 
evaluate their instructors prior to the awarding of grades. Bausell and 
i4agoon found that when students were divided into groups on the basis of 
discrepancies between their GPA and their expected grade that there was 
a systematic effect on subsequent ratings . As might be predicted, those 
students with expected grades higher than their GPA appeared to give in- 
flated ratings, while those students with low expected grades gave lew 
ratings. 

The objectives of the present research were threefold: 

1) To develop a short rating scale and associated diagnostic checklist; 

2) to examine the construct validity and reliability of the scale; and 

3) to assess the relationships between selected student demographic 
data and ratings on the scale. 

Pi 1 o t otudy 

A 10-item teacher evaluation instrument was constructed -3ing five 



point Likert questions pertaining to actual, obaerved teaching behaviors. 
In developing items, an effort waa made to compromise between high in- 
ference-statements (e.g., "attitude toward students'') and low inference 
questions (e.g., "me^s class on time")- Ten parallel items were con- 
structed on which^tudents indicated their preference for those behaviors 
/ 

(see Appendix^ A). Gagne'' and Allaire (cited in Grasha, 1975) suggested 

use of a discivioancy format (difference between observed and "ideal'' 
' xtings) could help to account for student rating variation. One of the 
p: .edurey conducted during this pilot tested this assertion in an incre- 

mci,.4.1 validity format. Observed item scores were regressed against two 
global, criterion items, resulting in multiple correlations of .86 and 
.70. Next, two multiple correlations were computed forcing the order of 
the predictor variables so that the observed ratings entered first, fol- 
lowed by difference scores (preferred items minus observed items) as a 
means ol' examining incremental validity. The resultant multiple correla- ^ 
tions of ..86 and .70 were identical to those regressions using the 
obi^erved items only. Thus, there was no unique criterion variance accounted 
for by the difference scores. The assertion of Gagne and Allaire was not 
supported in this pilot study. 

It was suggested, following intrepretat ions of the results of this 
pilot, that the observed/preferred format should not be employed for all 
questions since sufficiently large differences between the means of the 
observed and preferred items did not occur for all items. Lolli, Owen, 
and i-'roman {^^TJ) suggested that, in determining which rating items 
ffu.wilV/ lor an obr-.orved/pref erroJ J'onnat , field tests shouid be under- 
Laken Jor any new meai;ure{.:. A;; thfM'o appeai^ed to be little advantage 
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to using an observed/pref erred format with the current items, that for^ 
mat was discontinued in fui-ther revision of the scale* 

Revised Instrument 

Several of the pilot items were rewri Uen on the basis of difficulty 
in intrepretation, or because new items were believed to be more mean- 
ingful in terms of evaluative data. This revised instrument is shown in 
Appendix B. The demographic items appearing at the top of the instrument 
were included in order to assist in determining the validity of the instru- 
ment. Items 1 and 2 were designed to be global criterion items. An im^ 
portan: criticism of teacher rating' scales is the lack of external cri- 
terion measures which show that instructors who receive higher ratings 
do in fact produce higher amounts of learning in their students. Since 
external criterion measures are not immediately available, items 1 and 2 
were included as the best available measure.. 

I '^ms 3 through 13 were gleaned from existing rating instruments and 
from related literature. They represent issues which are suggested as 
important by recent research literature in the area of teacher effective-- 
ness • 

Hems 14 throu4:h 64 are included for diagnostic purposes and are in- 
tended to provide the individual instructo^r with feedback related to 
specific oehaviors which are particularly bothersome to students. It 
should be noted that items 14 through 64 are primarily the result of 
student input* Mar^y of these items nave appeared with high frequency 
in an open-ended ^'comments" section of a currently used rating scale. 
Notice that items 1 through 13 can be regarded as high inference measures, 
while,, items 14 through 64 are more typical of low inference measures. 
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The high inference items should be regarded as "evaluative" measures 
(say, for promotion or tenure decisions), while the low inference items 
are meant to be used as feedback for the improvement of instruction* 

Procedures , 

Because of the many different procedures in this study, each separ- 
ate description will be introduced with a brief research question that 
the procedure is intended to answer. The same format will be followed 
iri the presentation of results. The scale was given to 4,930 graduate 
and undergraduate students during the last week of the I975 fall semester 
The students were drawn from 12? classes across 84 departments. 

(What is the influence of expected grade, and GPA, on rating items?) 

To examine the construct validity, a series of multivariate analyses 
were undertaken. Two stepwise multiple regressions were run. First, 
the 11 evaluative items were regressed against the students* expected 
grades. The purpose of this procedure was to determine the effect of 
expected grades on the rating items. Next, the 11 evaluative items were 
.•egressed against the students' grade point average (GPA) to determine 
the effect of GPA on the rating items. 

(Do discrepant and non-discrepant groups yield si^ificantly different 
ratings ?) ' 

Third in this series of analyses, a Type I repeated measures ANOVA 
was run. The between-groups independent variable was student discrepant 
group membership (b main effect). The within-groups independent vari- 
ableu were the 11 evaluative items (A main effect). The dependent vari- 
abloH were the scores on the 11 evaluative items. ThiB design is shown 
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in Figure 1, and follows the suggestion of Greenhouse and Geisser (1959) 
for interpreting profiles of data. 



Figure 1 

Type I Repeated Measures Analysis of Variance for 
Discrepant Student Groups and Evaluative Items * 
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For the purpose of this analysis, a member of a discrepant group is 
any student whose QPR is two or more letter grades c way from his/her ex- 
pected grade. In other words, a student belongs to a discrepant group if 
that student: 

1) Is an **A" student but expects a C, D, or F; 

2) is a^ "B" student but expects a D or F; 

3) 10 a "C" student but expects an A or P; or 

4) iH a "D" student but expects an A or B. 

ti 
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The discrepant group was divided into two types: 

1) Students whose expected grade is two or more letter grades above 
their QPR ("positive" discrepant group), and 

2) students whose expected grade is two or more letter grades below 
their QPR (^negative" discrepant group). 

(What 2£e the psychologjcal dimensions underlying the evaluative portio n 
of this instrument ?) 

Next, a principal component analysis was performed on the evaluative 
items. Orthogonal and oblique rotations followed the component analysis. 
Alpha internal consistency estimates were calculated for the meaningful 
factors. 

(What is the influence of criterion item 2, and demographic items, on 
rating items ?) "~ — — 

Two stepwise multiple regressions were run against criterion item 2 
to test the adequacy of the instrument. First, the 11 evaluative items 
were regressed 'against criterion item 2 to determine the extent that the 
overall teacher rating question could be predicted from the 11 evaluative 
Items. Next, the demographic items (see Appendix B) were regressed 
against criterion item 2. The purpose here was to determine whether the 
several demographic items were related to the global criterion item 2. 

A preliminary analysis examined the correlation between criterion 
item 1 (How much have you learned in this course?) and subsequent actual 
mid-term exam scores for a sample of 169 students in an undergraduate 
educational psychology course. The correlation was .11. As the internal 
consistency estimate of the exam was .89, and the equivalence reliability 
estimate was .81, it was clear that student estimates of "amount learned" 
did hot overlap as much as expected with exam scores. Obviously, amount 
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learned does rot take into account the entering behavior (or knowledge) 
of the students. Perhaps a more accurate criterion would have been the 
residual gain between entry behavior and exit behavior. In any case, 
It was decided that item 1 was not a valid criterion measure and it was 
dropped from further analyses. 

Results 

Table 1 presents the means and standard deviations for the 11 evalua- 
tive items « 



Insert Table 1 



(jijfhat is the effect of expected grade , GPA, criterion item 2, and demo - 
g raphic items , on rating items ?) 

Table 2 summarizes the regression analyses and lists the dependent 
variables, independent variables, multiple correlation coefficients, and 
'•general outcomes" (the extent to which the dependent variable can be 
predicted from the independent variables) for each multiple regression. 



Insert Table 2 



(Do giscrepant and non-discrepant groups ^oeld significantly different 
ratings ?) 

The results of the repeated measures ANOVA are shown in Table 3. I. 
should be noted that the B main effect was significant beyond the .001 
level. Means for each group across each item are shown in Table 4. As 
a result of the significant A x B interaction the group- means acrosc.the 
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11 evaluative items are plotted in Figure 2. With the exception of the 
first evaluative item all interactions were ordinal. It will be shown 
in the report of ^the results of the factor' analysis that this item (evalu 
utivc item l) was a "splinter" item and did not load on either of the 
meaningful factors. 

Figure 2 

Type I Repeated Met'^ures Analysis of Variance for 
Discrepant Student Groups and Evaluative Items 
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Insert Table 3 
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Leaner 



Because of the ever present problem of missing data, four explora- 
tory principal, component analyses were run: two employing pairwise 
deletion and two using casewise deletion of data for both orthogonal 
and oblique solutions. The method of pairwise deletion was selected 
over casewije deletion because both options yielded highly similar load- 

i' 

ings and factor structures for each type of rotation, and because the 
pairwise deletion option included more information in its analysis. 
The oblique rotation was ultimately selected because it yielded a cl< 
analysis with less overlap of factors, and made better psychological 
sense. In addition, the primary (unrotated) axes were somewhat corre- 
lated with each other (r « .26). 

Table 4 contains the primary pattern loading matrix, oblique solu- 
tion for the eleven evaluative items, and alpha internal consistency ^ 
estimates. Only items with loadings of .30 or higher are included. 
Using a root criterion of unity, two dimensions were generated and 
both were meaningful enough to be retained. The two component solution 
accounted for 58.1 percent of the total variance associated with item 
interrelationships. The items of unit complexity (loading highly on 
only one factor) were given principal consideration in naming the 
factors. The content of the actual item stems was used to generate the 
factor descriptions. 



Casewise deletion removes all information from a subject if he is 
missing only. a single bit of data. Pairwise deletion, by contrast, 
removes only the particular bit of information missing. For a dis- 
cussion of benefits and limitations of each type of deletion, see 
Nie et al., 1975f PP- 353; 503-4. 
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Insert Table 4 



Factor I (47*5 percent of the total variance) was named Teaching 
Effectiveness. The item content defining the factor suggests that in- 
structors receiving a high score on this factor were perceived by their 
students as displaying behaviors within the classroom which help them 
to make clear explanations of their well organized materials and pro- 
cedures. Factor II (10.6 percent of the total variance) was named Inter- 
action With Students, and reflects the quality of exchange that takes 
place between the instructor and his/her students considered apart from 
the quality of the instructor's teaching behaviors per se. 

Discussion , 

The results of the reliabilities of both dimensions which emerged 
from the factor analysis were reported in Table 4. The coefficients are 
adequate to warrant their use in examining student ratings of teacher be- 
havior through the implementation of this instrument. Good intentions to 
factor the ^1 diagnostic items were thwarted when a $98.00 computer run 
depleted our account. (The factor analysis was begun, but was aborted 
before rotations were performed.) This analysis will be completed shortly, 
and is meant to be a confirmatory factor analysis. That is, it will test 
hypotheses about which diagnostic iters "fit** with which evaluative, high 
inference items. It is hoped that the --diagnostic items will generate com- 
parable factor structures that will allow instructors being rated to ob- 
tain additional information from the diagnostic items that will be able to 
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further explain poor ratings on given evaluative items. In this way, 
the instructor who wishes to can use this inetrument as a basis for im- 
proving teaching effectiveness by targeting specific behaviors which are 
troublesome with respect to the frame of reference of the consumers (i.e., 
students) . . 

One of the more frequent criticisms of student ratings of teacher 
behaviors is that students who do well in a course will give the instructo 
a high rating, while students who do poorly will rate the instructor lower 
A similar, parallel argument might be made for students with high versus 
low CPAs. e respective multiple correlations of .25 and .07 for the 
dependent variables expected grade and GPA (predicted by the 11 evaluative 
items) provide only weak support for that criticism. However, when both 
expected grade and QPR are combined (discrepant groups) we find an im- 
portant, different conclusion. The group means, collapsing across the 
11 items are significantly different. In addition, the direction of the 
bias confoiTTis to the criticism stated above. The latter outcomes can be 
seen in the repeated measures analysis; in particular, refer to Figure 2. 

A probable explanation for the contradictory findings yielded from 
the two multiple regressions versus the results from the repeated measures 
ANOVA is that since +.he non-discrepant group is so much larger than either 
of ' the discrepant groups that in both multiple regressions the total 
effect is to "wash out" the biasing effect of the discrepant groups* 
ratings. This biasing effect can only be seen clearly when the groups 
are compared through the repeated measures ANOVA. 

It seems clear that the discrepancy between QPR and expected grade 
should be considered by investigators studying student ratings of teacher 
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behaviors. On th<p basis of these results, we suggest that the evalua- 
tions by students who fall into either of the discrepant groups be de- 
leted from the subsequent calculations. This will probably result in 
a more accurate picture of the resultant ratings since the validity of 
the student ratings as a group is increased. This proposed method of 
data reduction is especially important for those classes which have 
small numbers of students, since the biasing effect of discrepant eval- 
uations could be increased if there is a disproportionate ratio of dis- 
crepant students in that class. Such a ratio would result, in question- 
able evaluations of that particular instructor. One example that comes 
to mind is the course that is required, and taught for non-majors. It 
IS not unreasonable to expect that the students who find themselves faced 
with a new and unfamiliar body of knowledge might expect to do less well 
than their QPR would indicate. 

The results of 'he stepwise multiple regressions In which the demo- 
graphic items were regressed against criterion item 2 yielded a multiple 
correlation of .25 (low predictability). However, in light of the find- 
ings stated above, the relationship of demographic variables to student 
ratings is an area which requires additional research. For instance, 
the interaction between the strongest demographic variable and group 
discrepancy needs to be tested. 

Summary . 

A rating scale employing Likert-type questions, two criterion ques- ' 
tions, demographic items, and diagnostic items was constructed. The 
observed/preferred format, as a m^ans of examining the student's frame 

13 



16 



of reference appears to be of questionable value, at least for the evalua- 
tive items appearing on this scale. Principal component analyses yielded 
two factors (Teaching Effectiveness and Interaction with Students). 
Resulting Alpha internal consistency coefficients vere adequai;e to war- 
rant the use of the instrument. Evidence was presented which Bu<^,gested 
that while individual multiple regressions of evaluative items against l) 
QPRt and 2) expected grade demonstratect no substantial effect. The re- 
sults of a repeated measures ANOVA demonstrated a biasing effect on 
ratings by discrepant groups. It was suggested that in order to increase 
the validity of student ratings, the evaluations made by discrepant group 
members should be omitted from computations. Further research was sug- 
gested to determine whether demographic variables interact with group 
discrepancy to influence evaluations of teacher behaviors. 
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Table 2 

Summary of Stepwise Regression Analyses 



Dependent 
Variable 



Expected Grade 



QPK 



Criterion 2 
(rate the . 
ins txixctor) 

Ci'^torion 2 
(ja^.ve the 
:! instructor) 



Independent 
Variables 



iivaluative Items 
Evaluative Items 
Evaluative Items 

Demographic Items 



R 



General 
Outcome 



•25 Low Predict- 
ability 

.07 No Relation- 
ship 

•82 Strong 

Relationship 



.25 Low 

Predictability 
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Table 3 

Source Table for Repeated Measures ANOVA with Three 
Student Groups Across Evaluative Items 

Source ss df re P 

Between Ss 

B (discrepancy ^oups) 186. 54 2 93.27 18.99*** 

^vror 14606.28 2974 4.91 

Within Ss 

A (eleven items) 289-39 10 28.94 52.35*** 

^ 51.46 20 2.57 4.65* 

error I644O.36 2974O O.55 

p < .001. 
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Table 4 

Factor Loading Matrix for Evaluative Items Derived From 
Principle Component Analysis a'^d Oblique Solution''' 



Item Number Item Stem Loading 



Factor I : 


Teaching Effectiveness (Alpha estimate = , 


.78) 


4 


Presents material clearly and effectively 


.86 


5 


Demonstrates overall organization 


.86 


6 


Make course objectives clear 


.64 


7 


Fulfills class objectives 


.82 


8 


Makes assignments clear 


.53 


9 


Stimulates interest 


.64 


10 


Exam items stress important aspects 


.46 


Factor II: 


Interaction with Students (Alpha estimate 


= -65) 


11 


Grades fairly and impartially 




12 


Is accessible to students 


.79 


13 


Shows interest in and concern for students 


.76 


The entire 
only shows 


matrix is available from the senior autho: 
items which loaded .30 or higher. 


r; this table 
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Appendix A 

The University of Connecticut Survey of Courses and Teaching 

Demographic Information " ' ' 

Your semester:[][][][][][][][][ ]is this course in your major field-^f If 1 
1 2345678 Grad '^yes^^no- 

Expected grade in this course:[ ][ ][ ][ ][ ][ ] Cumulative grade average: 

A B C D P Pass' 1.0-i.5[] 1 .5-2. Of] 2.0-2.5[] 

Percent of class attendence: 0-.24^o[] 25-49J{i[] 2.5-3.0[] 3.0-3. 5;^] 3.5-4.0[] 
50-74fo[] 75-89'/o[] 90-loo/o[] 

Time spent on this course compared to other courses of equal credit: 
considerably more[] somewhat more[] about the same[ ] 

somewhat less[] considerably lessf] 
Observed/Preferred Items 



Vor the following ten questions, rate each item in two ways: 

a. Give your actual rating of the instructor; then 

b. mark the second scale according to how you would prefer things to b( . Of 
course, if you're sat isfied,the 2nd scale will be marked the same as the first, 



From my observations, this instructor: 



DM 



1. preLients material in a clear dc effective manner 

(preferred) 

2. litimulates interest 

(preferred) !...!..!!!![ 

3. makeu work assignments Sc student responsibilities'clear. 1 * 

(preferred) ^ ^ 

4. uses exam items which stress important aspects of :he course.... 

(preferred) 

5. grades fairly and impart ially 

(preferred) 

6. demonstrates overall organization 

(preferred) ^ ] 

7. meets class regularly and on time 

(preferred) 

8. fulfills class objectives and obligations 

(preferred) 

9. is accessible to students both in and out of class 

(preferred) 

10. shows an interest in and concern for students 

(preferred) ^ 



g 


n 






0 






3 






(0 




H-<| 


c+ 




CD a> 


H 


HO 




3 






es 


(0 



Global Criterion Items 



11. How much have you learned fro his course? 

almost nothing[j a little[] moaerate ajnoiint[] a great deal[] 

12. How would you rate this instructor in general? 

poor[] less than adequate[] average[] good[] excellent[] 



PP 

s:3 
po 

COC+ 
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Appendix B 

The University of Connecticut Survey of Courses and Teaching 



Department Coui^se 4 Section . Branch 



Instructor 



DIRECTIONS: PLEASE USE A NO. 2 PENCIL TO PILL IN APPROPRIATE BLOCKS. DO NOT MAKE STRAY MARKS. DO NOT 
SIGN YOUR NAME. ERASE CLEANLY IP YOU CHANGE AN ANSWER. RESULTS WILL NOT BE SEEN BY THE 
INSTRaCTOR BEFORE THE COURSE IS COMPLETED. 



Semester Standing 




Is Couroe in Your Major 


Expected Grade 


Cumulative Average How 


Often Did you Attend 








in This Course 


(QPR) 




Class 






Yes Q 


A CI 


Less than 10 


□ 


90-1 005& u 






No Ll 




10-14 




75-^9^ U 










15-19 




7-3 






D d 


20-24 




25-49?S b 


9 or nore ^ 








. 25-29 


0-24A' lJ 


Graduate J 






Pass M 


30-34 


I 










Audit □ 


35 or more 






•"J . iiow nruch have 


you 


learned from this course? 


U L 


U 


u 


U 






almost nothing a little 


a moderate amount 


quite a bit a great deal 


};0W would 70U 


rate 


this instructor in 


D u 


Li 


U 




* ^Tcieral all around 


teaching ability? 


poor less than adequate average 


good 


exce llent 



Ple.ij:f.' omit 'onor>e of me following items which are not appropriate 



In :nv opinion, this insti*ucto:' 



ca 


:i 


P 


0 


0 


h-* 


% 


CO 


3 




<-»■ 


0 






Vi 




0 


c* 


me 


— ) 




m 


<-»■ 


I—' 




rr 

















'J- 

10. 

1 1 . 

12. 

13. 



fleets clacs regularly and on time 

r'recento material in a clear and effective manner 

Demonctratec overall organization 

Makes purpose and objectives of course clear 

Fulfills class objectives 

Jtakes work assignments and student responsibility clear 
Stimulates interest 

Uses examination items which stress important aspects of course 
Grades fairly and impartially 

Is accessible to students both in and out of class 
Shows an interest in and concern for students 



J 

[ I 

» I 

a 

J 

d 















J 








J 


L- 




1 


J 








u 




J 


LJ 




J 




U 


U 




d 


u 


J 








J 


J 


u 


d 


u 


J 


J 


Li 


■ J 





:iag:jo3tic check list; 



TfiE ir^GTRUCTOR: 



The following is for the use of your instructor for the purpose of identifying areas of 
performance which need improvement, please check ONLY those areas which you found to be 
PARTICULARLY TROUBLESOME OR BOTHERSOME. 
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I. J was consistently late in starting or 

ending class 
).^was consintentl/ unprepared 
). _ had illegible handwriting 
'. J came to class intoxicated' or otherwise 

incapacitated 
i.'^had a dintracting smokinr habit 

seemed unfriendly toward students 
)• .J cri ticised and embarrassed students 
unfairly 

•J was patronizirtg, talked down to students 
I maintainea attitude of "I am always right" 
p. enforced pointless rules 

J seemed preoccupied with personal problems 
(.^neglected course for other activities 
».,^was too dogmatic or opinionated to present 

materials fairly 
• displayed favoritism toward certain students 
I. J displayed prejudice on the basis of race, sex, 

religion, ethnicity, etc. 
'.Q spoke' in a monotone 

■,Jwas difficult to hear or understand (mumbles, 

accent, inaudible) 
.J had distracting speech habits 
.J had distracting mannerisms (nervouanees, pacing, 54 

twitch, etc.) 

, J used too nuch profanity, vulgarity 55 
.J engaged in too much idle chit-chat 
..J junt read lectures 5^ 
,Q preeentod dull, boring lectures ' 57 

.[jdid not cover materials in d'jpth, tO: aupar- 
ficial 



39 
40 
41 
42 
43 

44 
45 

46, 

47 

48 

49 

50, 
51 

52 
53 



B 



61 



62 



Q presented too much detail or 
trivia 

did not stress important points . 
presented material at too fast a pace 
presented material at too slow a pace 
went over the same thing too often 
L) did not use enough illustrations, 
examples 

spent too much time on class discussions 
spent too little time on class 
discusBions 

D responded to questions in a vague and con-63 

fusing manner 
n assigned readif^g materials and then ignored 
them 64 
D did not explain what was expected on exams 

or assignments 
Qdid not provide enough feedback or comments 
on exBJDS or pap)ers 
was slow in returning graded work 
stressed rote memorization or trivia rather 
than understanding on exams 
J applied grading standards inconsistently 
grades too easily 

did not provide enough exama or other 
materials for evaluation 
[] required purchaae of coatly materials which 
were not worth the price 
aesigned too much "busy work" 
makes unreasonably long and burdensome 
aealgnments 



THIS COURSE ; 
• J too much smoking took 
place 

. J cheating was widespread 
. U enrollments were too 

large for effective 

instruction 
.y content was not 

sufficiently 

challenging 
.[J content was over my 

head, too difficult 
, J content did not match 

expectations, 

catalog description 
ijtoo much responsibility 

was delegated to 

graduate assistant 



a 



ERIC 
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